Predicting Protein Function using Decision Tree

نویسندگان

  • Parminder Kaur Wadhwa
  • Surinder Kaur
چکیده

The drug discovery process starts with protein identification because proteins are responsible for many functions required for maintenance of life. Protein identification further needs determination of protein function. Proposed method develops a classifier for human protein function prediction. The model uses decision tree for classification process. The protein function is predicted on the basis of matched sequence derived features per each protein function. The research work includes the development of a tool which determines sequence derived features by analyzing different parameters. The other sequence derived features are determined using various web based tools. Keywords—Sequence Derived Features, decision tree. I. PROTEINS AND THEIR ROLE ROTEINS are the primary components of living things, and they play many roles. Proteins are the molecular machinery that regulates and executes nearly every biological function [4]. Proteins provide structural support and the infrastructure that holds a creature together; they are enzymes that make the chemical reactions necessary for life possible; they are the switches that control whether genes are turned on or off; they are the sensors that see and taste and smell, and the effectors that make muscles move; they are the detectors that distinguish self from oneself and create an immune response. Proteins have a variety of roles that they must fulfill: • They are the enzymes that rearrange chemical bonds. • They carry signals to and from the outside of the cell, and within the cell. • They transport small molecules. • They form many of the cellular structures. • They regulate cell processes, turning them on and off and controlling their rates. Despite their radical differences in function, all proteins are made of the same basic constituents: the amino acids. Each amino acid shares a basic structure, consisting of a central carbon atom (C), an amino group (NH3) at one end, a carboxyl group (COOH) at the other, and a variable side chain (R), as shown in Fig. 1. Chains of amino acids are assembled by a reaction that Manpreet Singh is with the Department of CSE & IT, Guru Nanak Dev Engineering College, Ludhiana (e-mail: [email protected]). Parminder Kaur Wadhwa is with the Department of CSE & IT, Guru Nanak Dev Engineering College, Ludhiana (e-mail: [email protected]). Surinder Kaur is with Department of CSE, Institute of Engineering and Technology, Bhaddhal, Ropar (e-mail: [email protected]). occurs between the nitrogen atom at the amino end of one amino acid and the carbon atom at the carboxyl end of another, bonding the two amino acids and releasing a molecule of water. The linkage is called a peptide bond, and long chains of amino acids can be strung together into polymers, called polypeptides, in this manner. All proteins are polypeptides. When a peptide bond is formed, the amino acid is changed (losing two hydrogen atoms and an oxygen atom), so the portion of the original molecule integrated into the polypeptide is often called a residue. The sequence of amino acid residues that make up a protein is called the protein's primary structure. Fig. 1 Basic Chemical Structure of Amino Acid II. PROTEIN FUNCTION The definition of biological function is ambiguous, and the exact meaning of the term varies based on the context in which it is used. It is obvious that the biological function of a protein has more than one aspect. Take for example a protein kinase; in the biochemical aspect, a kinase’s function would be the phosphorylation of a hydroxyl group of a specific substrate. The scope of interest implied by this definition does not require any more than a ‘disembodied’ protein performing alone in vitro. However, proteins perform their function within an organism, and this has consequences ranging from the subcellular to the whole-organism level. In a physiological aspect, the same kinase may be part of a signaling pathway, where a protein both phosphorylates, and is phosphorylated by, interacting partners. A mutation in this kinase might cause a disease, so yet another aspect is a phenotypic or medical one. Therefore, when speaking of a protein’s function, we must always specify the aspect or aspects of the functional description [2]. Manpreet Singh, Parminder Kaur Wadhwa, and Surinder Kaur Predicting Protein Function using Decision Tree P World Academy of Science, Engineering and Technology 39 2008

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Provide a Predictive Model to Identify People with Diabetes Using the Decision Tree

Background: Today, in most hospitals in Iran, there is an extensive database of patient characteristics that includes a large amount of information related to medical, family and medical records. Finding a knowledge model of this information can help to predict the performance of the medical system and improve educational processes. Methods: Data mining techniques are analytical tools that are...

متن کامل

Predicting The Type of Malaria Using Classification and Regression Decision Trees

Predicting The Type of Malaria Using Classification and Regression Decision Trees Maryam Ashoori1 *, Fatemeh Hamzavi2 1School of Technical and Engineering, Higher Educational Complex of Saravan, Saravan, Iran 2School of Agriculture, Higher Educational Complex of Saravan, Saravan, Iran Abstract Background: Malaria is an infectious disease infecting 200 - 300 million people annually. Environme...

متن کامل

Predicting Twist Condition by Bayesian Classification and Decision Tree Techniques

Railway infrastructures are among the most important national assets of countries. Most of the annual budget of infrastructure managers are spent on repairing, improving and maintaining railways. The best repair method should consider all economic and technical aspects of the problem. In recent years, data analysis of maintenance records has contributed significantly for minimizing the costs. B...

متن کامل

Determining Factors Influencing Length of Stay and Predicting Length of Stay Using Data Mining in the General Surgery Department

Background: Length of stay is one of the most important indicators in assessing hospital performance. A shorter stay can reduce the costs per discharge and shift care from inpatient to less expensive post-acute settings. It can lead to a greater readmission rate, better resource management, and more efficient services. Objective: This study aimed to ident...

متن کامل

Predicting the Risk of Osteoporosis Using Decision Tree and Neural Network

Introduction: Osteoporosis is one of the major causes of disability and death in elderly people. The objective of this study was to determine the factors affecting the incidence of osteoporosis and provide a predictive model to accelerate diagnosis and reduce costs. Method: In this fundamental descriptive study, a new model was proposed to identify the factors affecting osteoporosis. Data relat...

متن کامل

Predicting the Risk of Osteoporosis Using Decision Tree and Neural Network

Introduction: Osteoporosis is one of the major causes of disability and death in elderly people. The objective of this study was to determine the factors affecting the incidence of osteoporosis and provide a predictive model to accelerate diagnosis and reduce costs. Method: In this fundamental descriptive study, a new model was proposed to identify the factors affecting osteoporosis. Data relat...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2009